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Since it is not clear, in general, how an automaton should best be charac- 
terized, one of the purposes of this paper is to find ways to go from one 
characterization to another. In doing so, we have not been completely impar- 
tial — the programming approach has been emphasized more than the others. 
There are perhaps two reasons for this emphasis: First and the more ob- 
vious one is the closeness between theoretical programming discussed here 
and programming of digital computers. Secondly, the programming approach 
has provided a way of looking at automata that seems to make certain ideas 
less obscure — the construction of a universal program in Section III of this 
paper is one such example. In the theory of finite automata, Theorem 3 is 
an attempt to unify the ideas of complete and partial automata, which have 
generally been treated separately in the past. 

I. INTRODUCTION' 

The invention of modern computers seems to have been anticipated 
by many years by Turing. 1 Yet it is remarkable how little the progress of 
computers has been influenced by Turing's work. There is, perhaps, a 
basic difference in viewpoint that may account for this lack of conver- 
gence. Turing looked at machines from the point of view of their internal 
behavior. Although Turing originated the concept of universal machines, 
his idea seems to correspond much closer to that of our special-purpose 
machines. Every machine, by virtue of its state description, performs a 
specific task; a machine is altered only if its internal structure is altered. 
Computers, on the other hand, are generally specified in terms of their 
external capabilities. Their internal structure remains more or less fixed 
once they come into being. A computer is then a universal machine in 
disguise, and every Turing machine corresponds to a particular com- 
puter program. One may therefore study the behavior and structure of 
programs rather than work with states. 

The first step in this direction was perhaps taken by Wang, 2 who based 
his ideas of machines on a computer (which he called a B-machine) that 
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had four kinds of instructions: move to the right or left; mark; transfer 
conditionally. A B-machine is close to the ultimate in simplicity, but is 
still capable of computing everything that a Turing machine is capable 
of and, with a suitable program, is capable of being universal. 

As a model, B-machines are attractive because of their intrinsic sim- 
plicity. On the other hand, because a B-machine does not have the 
ability to erase, it is very difficult to write even fairly simple programs 
without having to work out intricate details. In this paper we have, 
therefore, introduced a modified B-machine— one which is em- 
powered with the ability to erase. We have called a machine of this 
kind a W-machine. 

The similarities and differences between W-machines and two-symbol 
Turing machines are shown in Sections II and III. In Section IV we 
describe the construction of a universal W-machine to show the kinds of 
techniques involved in W-machine programming. It may be interesting 
to note here that, once a few useful subprograms are written, the main 
linkage program takes but a few instructions. Because of its simplicity, 
one may suspect that it is harder to construct sophisticated combina- 
torial or symbol-operation kinds of programs on a W-machine than it is 
on a more complex computer. But we would not be surprised if such a 
suspicion turns out to be groundless; what makes a W-machine a poor 
computer may well be only its disregard for time. 

The subfamily of W-machines in which each machine has a bounded 
memory constitutes the family of finite automata. Because finite auto- 
mata are abstract models of sequential switching circuits, there has been 
much current interest in their behavior. As a result, there have been a 
number of approaches to problems in connection with finite automata. 
In Section V it is shown that finite automata may be characterized by 
the deletion of one of the five kinds of W-machine instructions. There is 
thus a program analog of finite automata. 

In Section VI the relation between finite automata and sets of input 
sequences is discussed. Among other things we present within our frame- 
work a result of Kleene 3 that makes it possible to represent finite auto- 
mata by algebraic-like expressions. This characterization seems very 
natural in many ways, except that the expressions can easily get very 
lengthy. The problem of how best to handle these expressions appears 
very intriguing and, as far as we know, is quite open. 

II. TURING MACHINES 

A machine will be called an A-machine if it consists, aside from its 
control mechanism, of the following: 
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i. A one-way potentially infinite tape (say infinite to the right) di- 
vided into squares. Each square can either he marked (having in it the 
symbol 1) or erased (having in it the symbol 0), and 

ii. A reading and writing head that scans some square of the tape at 
any discrete moment of time. Since the tape is finite to the left, the 
machine is assumed to stop if the read-write head is ordered to go to 
the left of the leftmost square of the tape. 

The content c of the tape of the A-machine previous to the initial 
moment of time, consisting of a finite sequence of zeros and ones, is 
called the (tape) input to the A-machine. As time advances, the tape 
content would change unless some stable condition is reached, so that 
we would get a sequence c of tape contents (c , Ci , ■ ■ •), where Cj is a 
later tape content than c, if i < j, and where d 7^ c i+i . The sequence c 
is called the external behavior of the A-machine relative to the tape 
input Co . Two A-machines are said to be completely equivalent if they 
have identical external behaviors relative to all tape inputs. That is, 
two A-machines are completely equivalent if they cannot be distin- 
guished by anyone observing just the sequence of tape contents. 

The idea of complete equivalence is too stringent at times. If an A- 
machine is used to compute values of a function, what the machine does 
while it is processing its data is, in a sense, irrelevant as long as the final 
answer turns out to be the desired answer. We will, later on, also con- 
sider a less stringent type of equivalence. 

The fact that an A-machine has a potentially infinite tape implies 
that it has an indefinitely large memory. It might be helpful to keep the 
notion that the tape is finite at any moment, but that at any moment a 
finite amount of blank tape may be added to the right whenever such a 
demand arises. In the same way, it is helpful to note that every input 
is a finite sequence of zeros and ones. We will, however, speak of the 
null input, meaning a string of zeros indefinitely long. The null input 
corresponds to an indefinitely long blank tape. 

We will consider the following model of a Turing machine, hereafter 
called a T-machine, as one of the A-machines. In addition to being an 
A-machine, it has k active internal states qi , q-< , • ■ ■ , qk and an inactive 
state q in which the machine is assumed to stop. The machine can have 
one of the following combination of actions: erase or mark the square 
under scan; move the read-write head one square to the left or one 
square to the right; go into some state qj . A T-machine is completely 
specified if its combination of actions is specified for every state of the 
machine and each of the two symbols under scan, and if the initial 
state and the initial square under scan are given. 
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For instance, the following one-state (i.e. one active state) T-machine, 
if started initially scanning a square in the interior of its tape, will have 
its read-write head swinging buck and forth, changing ones to zeros while 
going in one direction and changing zeros to ones while going in the 
other direction. The read-write head will either proceed indefinitely to 
the right or will eventually stop at the leftmost square. In this and later 
description of A-machines, we will use the letter m to denote the action 
of marking the square under scan ; e for the action of erasing the square 
under scan; + for the action of moving the read-write head one square 
to the right of the square under scan; and - for the action of moving 
the read-write head one square to the left : 





Symbol 







l 


*q 


'", +, 8 


e, -, q 



Here q designates the single active state of the T-machine, and * denotes 
the fact that q is also the initial state of this machine. If the square 
under scan is not marked, a mark is put in it, the read-write head moves 
one square to the right, and the machine returns to state q. If the square 
under scan is marked, it is then erased, the read-write head moves one 
square to the left, and the machine again returns to state q. 

From now on, we will at times use the notation q t ; m or e, -(-or — , 
qj ; m or c, -f- or — , g* for each combination of actions of any T-machine. 
Thus, the combination of actions of the one-state T-machine in question 
can be written: q; m, +, q; e, —, q. 

111. W -MACHINES 

A W-machine is an A-machine together with a program made up of 
an ordered list of the following five types of base instructions: (a) e: 
erase the square under scan; (b) m: mark the square under scan; (c) + : 
move the read- write head one square to the right; (d) - : move the read- 
write head one square to the left; and (e) t{A): transfer to program 
address A if the square under scan is marked, otherwise transfer to the 
next program address on the ordered list. These base instructions are 
executed in order by a control mechanism. The initial program address 
and the initial square under scan arc given. 

A program of a W-machine consisting of all base instructions with 
each instruction having a separate address is called a base program. Let 
us consider a W-machine completely equivalent to the one-state T- 
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machine illustrated earlier. The base program for this machine is 

1. t7 7. e 

2. m 8. - 

3. + <). a 

4. tl 10. m 

5. m 11. 12. 
(). t'2 

We note that the instructions in the program refer to only two ad- 
dresses, address 2 and address 7. The program may therefore he equally 
well written 

1. S3 

2. m, +,m,m,t2 

3. c, -, <3, nt, /2, 

where the instructions contained in one line arc understood to he exe- 
cuted consecutively. This notation simplifies the writing of W-machine 
programs and will be used in this paper wherever it is convenient to do 
so. 

A base program of a W-machine is said to be minimal if there is no 
W-machine completely equivalent to it with fewer base instructions in 
its program. In order not to have to consider special cases later, let us 
agree at this stage to rule out certain trivial redundancies in W-machine 
programs. Consider two W-machines, Wi and W-> , as follows: 

Machine Wi Machine \V 2 

1. in 1. w 

2. + 2. e 

3. ti 3. m 

4. + 

5. tl. 

Machines Wi and W 2 are not completely equivalent, since they have 
nonidentical external behavior. The difference is, however, of a minor 
nature. We will therefore agree that, whenever a W-machine program 
contains consecutive instructions 



A. 


e or m 


A + 1. 


e or m 


A + i. 


e or m, 
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only the last instruction (in address A + t) will be retained, and the 
others will be deleted. Furthermore, if it should become necessary to 
mark and erase a square in succession, the final symbol in that square 
will be accepted as the output symbol for that moment. 

The fact that a base program is minimal itself implies that the base 
program cannot contain certain subprograms. 

Lemma 1: 

Let P be a minimal base program of a W-machine. Then P cannot 
have two consecutive addresses A and A + 1 having in them the fol- 
lowing base instructions: 

(i) A. t(B) (iu)A. e 

A + 1. t(C); A + 1. e; 

(ii) A. e (iv) A. m 

A + 1. t(C); A + 1. m. 

Proof: In (i) and (ii), if address A + 1 is never referred to, P cannot 
be minimal since the (A + l)th instruction can be deleted. On the 
other hand, if there is some instruction t(A + I) in P, such an instruc- 
tion can be changed to t(C), again making the (A + l)th instruction 
superfluous. This proves (i) and (ii); (Hi) and (iv) are obvious, and the 
lemma follows. 

Theorem 1: 

I. Given a W-machine having b base instructions, there is a completely 
equivalent T-machine with not more than b states. 

II. Given a T-machine with s states, there is a completely equivalent 
W-machine with not more than 10s + 1 base instructions. 

Proof: Let a W-machine with b base instructions be given. That there 
is a completely equivalent T-machine is clear. It remains for us to show 
for part I of the theorem that b states would suffice. 

Let P be a minimal base program for the W-machine and A be the 
initial address of P. Then, by Lemma 1, the base instructions in ad- 
dresses A and i + 1 are one of the following: 

(i) A. t{B) (iv) A. more 

A + 1. e; A + 1. + or -; 

(ii)A. t(B) (v)A. +or- 

A + 1. m; A + 1. more; 

(iii) A. t(B) (vi) A. + or - 

A + l. +or-; A + 1. +or-. 
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In (i), (ii) and (iii), we assert that the base instruction in address B 
can be made one of the following: 

(a) B. e or (b) B. -for-. 

This is true because, if the instruction in B should be t(C) and the in- 
struction in C should be t(D) and so on, then at some point in the chain, 
say address E, the instruction must be a nontransfer instruction, for 
otherwise the program would not have been minimal. We may then 
replace the instruction t(B) in A by t(E). On the other hand, if the 
instruction in B should be m, then the instruction in A could have been 
replaced by: 

A. t(B + 1); 

and the assertion follows. 

In (i) and case (a), by Lemma 1, the base instructions in addresses 
A + 2 and B + 1 must be + or - . Thus, address A can be associated 
with a T-machine state 

q(A); c, + or -, q(A + 3); e, + or -, q(B + 2). 

Similarly, in case (b) address A can be associated with a T-machine 
state 

q(A); a, + or -, q(A + 3); m, + or -, q(B +1). 

It should be noted that a T-machine state may replace more than 
just address A. For example, in (i) case (a) the T-machine state replaces 
the five addresses A, A + 1, A + 2, B and B + 1 if none of these ad- 
dresses is referred to elsewhere in the program. Therefore, in going from 
a W-machine to a T-machine as described by the procedure outlined 
here, the T-machine will in general have fewer than b states. 
In (ii), the (.4 + 2)th instruction can be either 

A + 2. + or - or A + 2. t(C). 

The former is no different from (i). In the latter, the instruction in 
address C can be made one of the following : 

C. e or C. + or — . 

The T-machine states to be associated with address A in case (a) cor- 
responding to these two subcases are respectively 

q(A); e, + or -,g(C + 2); e, + or -, q(B + 2), 
and 

q(A); m, + or-,g(C+l); e, + or -,g(B + 2); 
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and in case (b) are respectively 

q(A); e, + or -,q(C + 2); m, + or -, q(B + 1), 

and 

q(A); m, + or -, q(C + 1); m, 4- or -, q(B 4- 1). 

Case (iii) is similar to (i). In (iv) and (v), the two addresses, A and 
A + 1 , can obviously be associated with a single T-machine state. In 
(vi), each address A or .4 + 1 may be associated with a single T-ma- 
chine state. Therefore, there is a completely equivalent T-machine with 
not more than b states and part I of the theorem follows. 

To prove part II, let a T-machine with s states be given with states 
q it i =- 1,2, •■-, s: 

qr, o,-(0), &,-(()), g.-(0); a,(l), b<(l), 8*(1), 

where a, is either m or e and 6, is either + or — . Associate with each 
state q,- two addresses .4 , and A , of a W-machine : 

Ai. o,(0), 6 f (0), t[A'i{0)],m,t[Ai -(0)1; 
ill. o^iJ.WD.pIdJUrti^i)]. 

Next, if oy is the initial state of the T-machine, we will add an initial 
address A > — 1 where we have 

A 3 - 1. t(A'j). 

The W-machine so denned is completely equivalent to the T-machine, 
having exactly 10s + 1 base instructions. This proves part II of the 

theorem. 

The bound 10s 4- 1 on the number of base instructions cannot be 
lowered if the first address is to be always the initial address of a W- 
machine program. If we are allowed to begin a program at some inter- 
mediate address, the bound 10s 4- 1 can be lowered to perhaps 8s 4- 1- 

From this result, it follows that whatever is true about T-machines 
is functionally true about W-machines, and conversely. The choice of 
whether to use the T-machine or the W-machine model is therefore 
somewhat arbitrary. We have found that the T-machine model is con- 
venient for state description of finite automata (Section V) and the 
W-machine model more satisfactory for problems involving operations 
with symbols. The latter contention is illustrated by a universal W-ma- 
chine described below. 
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IV. A UNIVERSAL W-MACHINE. 

A very interesting result of Turing 1 was his construction of a machine 
which is capable of imitating any target machine when given a descrip- 
tion of the target machine. Such a machine is known as a universal 
Turing machine. To our knowledge there have been two independent 
and very ingenious constructions of universal machines which greatly 
simplified Turing's work. The earlier one is due to Moore 4 and the other 
to Ikeno. 5 Moore's machine has two symbols and three tapes, and can 
be reduced to nine states. Ikeno's machine requires six symbols and 
ten states, giving a state-symbol product of 60. 

A word should perhaps be said in regard to the two extra tapes in the 
case of Moore's machine and the four extra symbols used in Ikeno's 
machine. In either case, the universal machine is just slightly different 
from the target machines it imitates. It would be more "authentic" for 
a universal machine to be immediately within the class of all target 
machines it imitates. It also seems that it is as direct to construct such 
an "authentic" machine as otherwise. For these reasons, we include here 
the construction of a universal W-machine as an example. 

Let U denote the universal W-machine to be constructed. Let the 
squares on the tape of U be divided into two classes: a-squares and b- 
squares. If the squares are numbered beginning with one for the left- 
most square and proceeding to the right, then the a-squares are the 
odd-numbered squares and the 6-squares the even-numbered squares. 
The 6-squares are there to serve as markers. The description of the 
target machine together with the data occupies only the a-squares. In 
order to clarify coding, the contents of a-squares arc underlined. Thus 
I would mean a mark and a blank in adjacent a-squares, the content 
of the inbetween ^-square has been left unspecified. 

The W-machine instructions are coded as follows: 

Instruction Code 

* (Stop) 1 

+ II 

III 
m 1111 

e I I I I I 

t(n) 1 1 1 1 1 1 1 ••• 1. 

n ones 
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The program and data of the target W-machine that the universal 
W-machine U is to imitate occupy only the a-squares on the tape of U. 
The instructions are coded in sequence, with a single blank a-square 
separating adjacent instructions. The data go directly into a-squares 
without modification. There is a single blank a-square between the last 
instruction and the data. 

The first two a-squares are blank and all a-squares to the right of the 
data are blank. The 6-squares are all marked except for (a) the first 
6-square, (b) the 6-square immediately to the right of the data square 
under scan and (c) all 6-squares to the right of square x, where x is the 
a-square to the right of the last data a-square. 

The coding scheme will be made clear by an example. Suppose the 
program of the target W-machine is 

1. 13 

2. + 

3. m 

4. t2, 

where the initial address is address 1 and the data are 

i 
10 11 

where the third symbol is the initial symbol under scan. In the coded 
form, the tape of U would have contents 

Begin 
1 
1 111111111111111 1 Q 1 1 1 1 1 

tz + 

01111111110111111111111111 



m 



t'2 



Symbol under scan 

1110 110 1 10 

Data 

The program for the universal W-machine U is divided into a main 
program P and a number of subprograms. The various subprograms are 
designated by symbolic addresses as follows : 
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RT One square to the right. 

LT One square to the left. 

MK Mark square under scan. 

Ell Erase square under scan. 

TR Transfer if data square under scan is marked. If transfer is 

effective, go to the beginning of tape and hunt to the right 
until the correct instruction has been found. Otherwise, go 
to the next instruction. 

RTZ Right to zero. 

LTZ Left to zero. 

R.DZ Right to double zero. 

LDZ Left to double zero. 

The program for U begins with the main program P. It first examines 
the instruction to be carried out. If the instruction should be + , — , 
m or e, the program enters subprograms RT, LT, MK or ER respectively. 
If the instruction should be t(n), the program enters subprogram TR. 

Let us begin with the basic subroutines RTZ, LTZ, RDZ and LDZ : 

RTZ 1. +2, t\. 

LTZ 1. -2, t\. 

RDZ 1. +2, t\, +2, a. 

LDZ 1. -2, tl, -2, a. 

Next the subprograms TR, RT, LT, MK and ER: 

TR 1. +, e, RTZ, -,t2, -, *(I,T3), 

2. LDZ, +4, 

3. c, +, RTZ, m, +, t4, + , LTZ, -, RDZ, m, t(P), 

4. +, e, LTZ, -, RDZ, m, RTZ, +2, e, LDZ, m, /3. 
LT 1. +, e, RTZ, m, -2, 

2. e, 

3. LTZ, m, +, «(P). 

RT 1. +, e, RTZ, m, +2, m, t(LT2). 

MK 1. +, e, RTZ, -, m, *(ER2). 

ER 1. +, e, RTZ, -,e, 

2. -,*(LT3). 

Finally, the main program P: 

P 1. +2, 12, *, 

2. +2, IS, RT, 

3. +2, «4, LT, 

4. +2, «5, MK, 

5. +2, /(TR), ER. 
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After the following sequence of instructions of the target machine has 
been executed: 



*3, m, 12, + , 



the tape contents read: 



Begin 
i 
11111111 1 1111111 10 11111 

«3 + 

Begin 
instruction m 

I 

01111111110111111111111111 

v ^ -' V — — v — *' 

m t2 

Symbol 
under scan 
1 
01110111 10000 ■■■ 



Data 

We will call a target W-machine admissible if its read-write head never 
goes to the left of the leftmost square on tape. Machine U then imitates 
all admissible target machines and is itself admissible. 

It may be interesting to note that the coding for machine U does not 
make an intrinsic distinction between program and data. The burden 
of distinguishing which is program and which is data is therefore on the 
coder. 

Using the conversion procedure discussed in the proof of Theorem 1, 
there is a T-machine completely equivalent to the W-machine U with 
about 74 internal states. f The program for U itself requires some 125 
base instructions. As things go, it is not impossible for someone to im- 
prove our result to a 50 base instruction universal W-machine or a 25- 
state universal T-machine or perhaps even better. The answer to the 
problem of finding a universal machine with the smallest state-symbol 
product posed by Shannon 6 seems to be quite remote, even for two- 
symbol machines. 



tSome of the ideas that resulted in this construction were due to D. Younger, 
who indicated a possible reduction to a machine of about 56 states. 
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V. FINITE AUTOMATA 

There is a subfamily of T-machines that are abstract models of a 
class of switching circuits called sequential circuits. The dominant trait 
of these machines is a strictly limited memory, so that they are called 
finite automata. (These machines are also known as sequential machines.) 
Because of their limited memory, rather simple tasks lie beyond the 
reach of finite automata. For instance, there is no finite automaton that, 
having the null input and ejecting symbols one at a time, will give us 
the successive digits of t or, for that matter, any number that is not 
rational. On the other hand, many decision problems become finite 
problems for finite automata; in fact, in some cases efficient algorithms 
have been found. 

A two-symbol finite automaton consists of 
i. A finite number of internal states qa,qi , • • • , q„ . 
ii. An alphabet of two symbols: s = 0, Si = 1. 

iii. A map M whose domain and range are both subsets of the set of 
stale-symbol pairs. If M is defined for a state-symbol pair (5, , Sj), 
then M(qt , Sj) is another pair {q k , s r ). The symbol 8j is called an 
input symbol. The symbol s r is called an output symbol, and is 
completely determined by g, ; that is, s r is independent of the 
input symbol Sj . 
iv. An initial state q n , which can reach every state q, , < i ^ n, 

via some suitable input sequence of symbols. 
In the definition of a finite automaton given above, we included those 
automata in which the map M may be undefined for some state-symbol 
pairs (q t , Sj). We will call such automata partial automata. Partial 
automata in the past have been treated somewhat differently from 
complete automata. By considering certain input sequences called 
acceptable sequences, we will be able to treat partial and complete 
automata on a uniform basis. 

5.1 Finite Automata and W*-Machines. 

In the beginning of this section we mentioned that finite automata can 
be regarded as a subfamily of T-machines, and hence as a subfamily of 
W-machines. Let us call a W-maehine a W*-machine if the base program 
of the W-machine docs not contain the instruction " — "; that is, if the 
read-write head of the W-machine never moves to the left. We will see 
that, by suitable interpretation of inputs and outputs, every finite 
automaton is completely equivalent to some W*-machine and, further- 
more, that every W*-machine differs from some finite automaton by at 
most a unit of delay in the output. 
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Let S be a finite automaton; S may then be considered as a T-machine 
in the following sense : An input sequence of symbols to S corresponds to 
having this sequence of symbols on the tape of the T-machine, beginning 
with the initial input symbol on the leftmost square of the tape. In 
operation, the T-machine begins by scanning the initial square, writes 
the output symbol on the square being scanned, moves one square to the 
right and goes into its next state. At any moment, therefore, the previous 
output is contained in the square just to the left of the read -write head, 
and the present input is contained in the square directly under the read- 
write head. In this way the read-write head of the T-machine never 
moves to the left. It follows from Theorem 1, therefore, that there is a 
W-machine whose program consists of no base instruction of the form 
" - " and is such that this W-machine and the T-machine are completely 

equivalent. 

Conversely, suppose a W*-machine is given. By Theorem 1, there is a 
T-machine completely equivalent to this W*-machine such that its 
read-write head never travels to the left. Such a T-machine may not be 
in the form of a finite automaton since its output symbol may be a func- 
tion of both the input symbol and the current state of the machine. We 
wish to show therefore that such a T-machine differs from a finite 
automaton by at most a unit of delay in the output. 

Consider a T-machine whose read-write head never travels to the 
left. It then consists of states of the following kind: 

where a t and &* are either e or m. In the particular case a. = b> for some 
i, the output becomes in no way dependent upon the input. We will 
therefore consider only those states q< for which a, p* h . 

Let us now form a new T-machine by splitting each such state q t of 
the original T-machine into two states, q i0 and qn , such that we have 
for the new machine, 

Qio ; a« i +, 9/o ; a. i +, Vn 

and 

qn ; bi,-\-,qko) &i , +, 9" i 

and, if q should be the initial state of the original T-machine, add a new 
state q * as the initial state of the new machine: 

go*; c, +, 500; e, +, g i • 

In operation, the new machine imitates the original machine faithfully, 
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except that the output of the new machine is delayed by a unit of time ; 
that is, the present output of the new machine is the previous output 
of the original machine. We have therefore 

Theorem 2: Every finite automaton with s states is completely equiva- 
lent to a W*-machine with not more than 10s + 1 base instructions. 
Every W*-machine with b base instructions differs from a finite automa- 
ton of not more than 26+1 states by at most one unit of delay in the 
output. 

An Example. Consider the following W*-machine: 



1. m 

2. + 

3. m 

4. t8 

5. e 



C>. + 

7. t2 

8. + 

9. + 
10. t5. 



This W*-machine is completely equivalent to a five-state T-machine 
with initial state Qii 



State 


Symbol 





1 


*3« 

33 
</7 
</9 
3l0 


m, +, q 9 

C, +, ?9 

e, +, qio 
e, +, stop 


m, +, § 7 
m, +, q 9 
m, + , q 3 
m, +, </io 



The T-machine is not in the form of a finite automaton, since its output 
symbols depend on both the state and the input symbol. Let us there- 
fore split each state whose output symbol is different for different input 
symbols into two states and, in addition, define a new initial state qo*. 
The machine then becomes: 







Symbol 




State 











1 


*„ * 


e, +, 3i.o 






e, +, ?i. i 


01.0 


e, +, 5s 






e, +, 33 


01.1 


'», +l 37.0 






m, +, q 7 ,i 


q 3 


'»> +, 39.0 






m > +7 39.1 


57.0 


e > + > 59.0 






e, +, 39.1 


57.1 


'«> +, 33 






m, +, q 3 


5s.o 


e, +, 3io 






e, +, 310 


39.1 


m, -t-, qm 






m, +, 3io 


3io 


e, +, stop 






e, +, 37.1 
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This machine is identical with the original W*-machine except that 
its output symbols are delayed by a unit of time and its initial output 
symbol is always a zero. For the same input, the sequence of the tape 
contents of the two machines are therefore not exactly the same; the 
tape content of the new machine to the left of the read-write head is 
the tape content of the original machine to the left of the read-write 
head translated one square to the right. The tape contents of the two 
machines to the right of the read-write head are, of course, the same. 

Since the output of a finite automaton depends only on its state, and 
since the symbol + is redundant, the state-symbol table of a finite 
automaton can be simplified. For instance, the nine-state machine given 
in the example can be given by : 





Symbol 


Output 







1 


01,0 

01,1 

qx 
07. o 

07.1 

0a. o 

09.1 
010 


01.0 

03 

07.0 

09. n 

09.0 
03 
010 
010 

stop 


01.1 

03 

07.1 

09,1 

09,1 

03 

010 

010 

07.1 






1 
1 



1 



1 





For complete automata, except for including the initial state in our 
model, this description is the same as that given by Moore. 7 In the same 
way, the description of the five-state T-machine in the example which 
is completely equivalent to the original W*-machine can also be simpli- 
fied. We may write 





Symbol 


Outputs 







1 


*01 

03 

(n 

09 

</ll> 


03 
09 
09 
010 

stop 


07 

09 

03 

010 

07 




1 






1 

1 
1 
1 





where to each state may be associated two output symbols, one for each 
input symbol. This description is essentially the model of sequential 
machines used by Huffman 8 and Mealy. 9 It is quite clear from the fore- 
going that there is a close relationship between these two models, and 
that one may go freely from one to the other, f 

t Another way of relating models of finite automata is discussed by Cadden. 10 
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5.2 Finite A utomata with a Minimum Number of States 

A problem of interest to switching circuit designers is finding finite 
automata having a smallest number of states. In relay circuit design, 
for example, the number of relays needed is usually a monotone function 
of the number of states the circuit has. For such circuits, therefore, the 
number of states becomes in a way a measure of cost. 

Let .4 be a partial finite automaton. A finite sequence is said to be an 
acceptable sequence for A if there is an output sequence and a terminating 
state when this sequence is presented as the input sequence to .4, with 
.4 beginning in its initial state. We will call the set of all acceptable 
sequences for A the acceptable set for .4 and denote this set by R(A). 
Now let A and B be two partial finite automata and let the intersection 
Rah = R(A) a R(B) be called the common acceptable set for .4 and B. 
Then A and B are said to be completely equivalent with respect to R AB if, 
for all input sequences belonging to R AB , A and B give identical output 
sequences. If R is a subset of R AB , then- equivalence of A and B with 
respect to R is defined similarly. It is clear that this definition of com- 
plete equivalence is the same as that given before for T- and W-machines, 
except the input sequences are now restricted to just the acceptable 
sequences. 

As an example consider A and B defined as 

B: 





A 


: 






Symbol 




State 




Output 











1 




*«o 


Ol 




1 


"i 




fli 






State 


Symbol 


Output 





l 


*6o 

6, 


b, 


6. 


1 






The acceptable set R(A) for .4 is the set of all finite sequences JO, 01, 
01 1, 0111, • • • ) and the acceptable set R(B) for B is the set of finite 
sequences { 1, 11, 1 10, 1 101, 11010, 110101, • • • ). There is no sequence 
that is acceptable to both .4 and B. The common acceptable set R AB is 
therefore empty. 

Theorem 3: Let -4 and B be two partial finite automata with a and b 
states respectively, where a, b > 1. Let R AB be the common acceptable 
set for A and B and let R AB (l) be the subset of R AH such that every 
sequence in R A n(l) is of length ^/. Then .4 and B are completely equiva- 
lent with respect to R A „ if and only if they are completely equivalent 
with respect to R AH {1) for / = ab — 2. 

Before going through the proof, it would be helpful to discuss some 
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notations that will be used later. If A is a finite automaton, its initial 
state will be denoted by a . If q is any state of A, we will denote the 
output symbol associated with the state q by w(g). Moreover, let s = 
(so , Si , ■ • • , s m _i) be an acceptable input sequence for A. Then we will 
at times let a* stand for the state reached by A after receiving the tth 
symbol of s. It will be convenient here to speak of the motion diagram 
for A : 

Input symbols: s Si • • • s m -i 

Transition of states of A : a n a x a-> • • • Om-i a* . 

Proof: The theorem is clear in one direction. In the other direction, let 
A andB be completely equivalent with respect to the set R A ii(ab - 2); 
that is, A and B will give identical output sequences to every commonly 
acceptable sequence of length not greater than ab - 2. 

Let us now suppose that there is a common acceptable sequence s = 
(so , Si , • ■ • , s m -i) of minimum length m where m > ab — 2 such that 
in the motion diagram for A and B we have 

So Si • • ' S nl -l 

/\ /\ /\ 

a «1 02 • • • Ctm-l dm 

/\ /\ /\ 

b bi b-i • • • b„,-i b,„ 

where 

»((!<) = «(&,-) for i - 0, 1, • ■ • , m - 1 but w(a m ) 5^ <a(b m ) . 

There are now two cases to consider. The case m > ab - 1 is simpler and 
will be left to the reader. 

Let us therefore assume m = ab - \. In the motion diagram above, 
we have then exactly ab pairs of states: (a , 6o), (<*i » &0i ' ' ' > ( a >» > &«)■ 
First, suppose that these a6 pairs are not distinct; that is, suppose 
(a, , bi) = (a,- , b^ for some ^ i < j ^ m. The motion diagram then 
becomes 

So • • • Si • • • S; • • • Sm-1 

/\ /\ /\ /\ 

a fli • • • a, Of+i • • • ay 0/+i ■ ■ ■ a-m-i a m 

/\ /\ /\ /\ 

b bi ••• bi b i+ i ■ ■ ■ bj bj+i • • ■ bm-i b m • 

Consider the common acceptable sequence s* = (s , • • • , s,_i , s ; - , 
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• • • , s„,_i), which is of length /, where I < m = ab — 1. Since a>(a m ) ^ 
w(&m), A and B would give different output sequences to the input 
sequence s*, contradicting our hypothesis that A and B are completely 
equivalent with respect to R A b (ab — 2). We must therefore assume that 
the ab pairs of states (oo , &o), • ■ ■ , (a,„ , b m ) are distinct, and thus in- 
clude every possible pair of states of A and B. 

Now let a m and b m be states of A and B respectively such that a m ?± 
a m and & m y£ b m . Then we assert u(a m ) ^ a>(a m ) and io(b m ) ^ co(b m ). For, 
if co(a m ) = co(a m ), then co(a,„) ^ w(6 m ). This is impossible, however, since 
the pair (a m , b m ) is one of the ab distinct pairs of states. The same argu- 
ment shows u(b m ) t* o)(b m ). We have now then the inequality u(a m ) 9^ 
<a(b m ). But again this is impossible. This concludes the proof. 

Although we cannot say that the bound ab — 2 is the best for all 
pairs (a, b), we will show that ab — 2 is very close to the best we can 
hope for. To do this we will now exhibit a pair of families of finite auto- 
mata. 

Consider first a family of finite antomata [A m },m ^ 1, as follows: 







Symbol 






State 








Output 



















1 




*«o 

ai 


«2 













dm-l 
Am 


a m 






Oo 


6 




Next, define a family of finite automata {B,,\, n ^ 1, as follows: 



State 


Symbol 


Output 





1 


*b 

6. 


60 

bi 
L-i 


6, 
62 

L 





6 
1 



For any pair of automata (A m , B„), one from each family, the set 
RA m B B of all commonly acceptable sequences consists of all sequences 
each of which must be of the form 



••• 1 



1 







1, 



m 0's 



fflO's 



mO's 
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since these are the only sequences acceptable to A m . For these two finite 
automata (A m , B n ), the minimum length of any input sequence in 
RA m B„ that would cause A m and B„ to give different output sequences 
would be ab — min (a, b), where in this case a = m + 1 and b = n + 1. 
For instance, the motion diagram for the pair (A 2 , B s ) would be 

10 10 1 

/\ /\ /\ /\ /\ /\ /\ /\ /\ 
a «i tt« flo en «2 fflo «i &i o-o 

/\ /\ /\ /\ /\ /\ /\ /\ /\ 
b b u b a bi bi bi fo 2 b 2 6 2 b 3 . 

Since 63 is the only state of B 6 that gives an output symbol of 1 , we see 
that the input sequence (0 1 1 1) is the first such sequence 
that causes A 2 and B A to give different output symbols. 

In general, by the same construction, we find that given two finite 
automata, one from each of these families, no input sequence of length 
less than ab — min (a, b) would enable us to tell them apart. We there- 
fore have 

Theorem J/.: Theorem 3 would not hold if I were made less than 
ab — min (a, b). 

In particular, we note that Theorem 3 implies Theorem 4 for the case 
min (a, b) = 2. For the cases min (a, b) > 2, there may be some slight 
improvement f possible for Theorem 3. 

Actually, Theorem 3 is interesting for another reason. It is essentially 
a theorem showing the existence of a decision procedure for finding 
finite automata with a minimum number of states. Historically, the 
problem of finding finite automata with a minimum number of states 
was studied and solved in a rather special way. Thus, both Moore 7 and 
Huffman 8 gave ingenious procedures for state minimization of complete 
finite automata. It was not uncommon for people to assume that these 
procedures also worked for partial automata before the introduction of 
several interesting counter-examples by Ginsburg. 12 As we see from 
Theorem 3, much of the earlier confusion was probably due to a disre- 
gard of the idea of acceptable sequences. 

VI. FINITE AUTOMATA DEFINED BY INPUT SEQUENCES 

Up to now we have shown that finite automata can be described in two 
different ways. In the definition given in the previous section, a finite 

t In the paper by Rabin and Scott," a theorem similar to Theorem 3 was ob- 
tained for the family of complete automata. In view of the fact that they were 
dealing exclusively with complete automata, their theorem could be considerably 
improved. 
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automaton is characterized essentially by its state-symbol table. On the 
other hand, one may characterize a finite automaton by giving its W*- 
machine program. The latter characterization illustrates the close paral- 
lel between computer programming and logical design. In this section, 
following the earlier work of Kleene, 3 we will consider a third characteri- 
zation of partial finite automata. This characterization leads to a very 
interesting algebraic-like structure for finite automata. Our purpose here 
is to connect this characterization with (he others. Much of the work 
along the approach of Kleene had been pursued and simplified by Myhill 13 
and Rabin and Scott. 11 The interested reader may refer to these papers 
and other unpublished work by Myhill. 

Let A be a finite automaton. A finite input sequence to A is said to 
be a signal sequence for A if this input sequence causes A to terminate 
in a state whose output is the symbol 1. The set of all signal sequences 
for a finite automaton A is called the signal set for A, and is denoted by 
T(A). 

Given a finite automaton A, the signal set T(A) is uniquely defined. 
On the other hand, if signal sets are to represent finite automata, it 
would be most desirable that two "different" automata have different 
signal sets. Let us consider automata A and B given by 

A: B: 



State 


Symbol 


Output 





1 


*a 
a. 


d 


a. 


1 






Symbol 




State 




Output 











1 




*6o 




6 2 


1 


fci 




6 2 





h 


h 








If nothing is said about input sequences, one may say that A and B are 
different, since every input sequence acceptable to A is unacceptable to 
B and vice versa, although A and B both have the empty set as their 
signal set. In order to have a clear-cut correspondence between signal 
sets and finite automata, we must therefore restrict ourselves to accept- 
able sequences. 

Theorem 5: Let A and B be two finite automata and R AB the common 
acceptable set for A and B. Then A and B are completely equivalent 
with respect to R AB if and only if A and B have the same signal set. 

Proof: From the definition of signal set, it is clear that, if A and B 
are completely equivalent with respect to R AB , then T(A) = T(B). 
Now suppose ,4 and B have the same signal set but are not completely 
equivalent with respect to R A b ■ Then there is some input sequence 
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, Si , • 



, s„_i in R AB giving the motion diagram 



So Si • • • s„_i 
/\ /\ /\ 

«0 fll 02 " ' ' &„_i &„ 

b bi b 2 



b n -i b n 



such that co(a») ^ u(6„); that is, the output symbols associated with 
states a„ and 6„ are different. Since we are considering only two-symbol 
automata, it is clear that the input sequence s , Si , • • • , s„_i cannot 
be a signal sequence for both A and B. The proof now follows from this 
contradiction. 

We see from this that signal sets indeed represent finite automata. In 
many ways this is a rather natural characterization. For example, con- 
sider a sequential lock on a vault. The vault can be opened only if a 
given sequence s of symbols is applied to the lock. Any other sequence 
of input symbols may cause the lock to go into an alarm state. In this 
case, we may consider the lock as a finite automaton denned by the one- 
element signal set {s\. 

There are other situations, however, where it seems simpler to describe 
a finite automaton by its W*-machine program or its state-symbol table. 
It is therefore not clear in general how a finite automaton is best charac- 
terized; as far as we can tell, a great deal depends on personal taste. 
The next best thing one can do, therefore, is to find ways to go from one 
form of characterization to another. 

We will begin by redefining several operations on sets of finite se- 
quences due to Kleene. Let X and Y be two sets of finite sequences; 
X v Y is then the set union of X with Y. By XY, called the string 
product of X with Y, we mean the set of all concatenated finite sequences 
of the form xy with x £ X y 6 Y. Finally, by the closure of the set X, 
denoted by X*, is meant the set 

X* = v X v XX v XXX v • • • , 

where is the empty set. 

To illustrate the use of these operations, let us consider the following 
automaton A : 



State 


Symbol 


Output 





1 


*a 
ai 


ai 


a 

flo 




1 
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The signal set of .4 is then given by 

T(A) = 1*0(1 1*0)*, 
where we have used the notations and 1 to stand for the one-element 
sets {0} and jl}. 

In general, to find the signal set for some finite automaton A is not 
as straightforward as this example indicates. We will describe below 
one such procedure, t 

Let A be a finite automaton with k states a , ai , • • • , at-i . Then 
by P(a,i , dj) we mean the set of all finite sequences such that beginning 
with state a, , each of these sequences causes A to terminate in state a, . 
Furthermore, let us denote by P(a, , ay ; ay) the set of all finite sequences 
such that, beginning with state a, , each of these sequences not only causes 
A to terminate in state a, , but also never causes A to pass through state 
(tj . In other words, it is permissible for a, = ay , but in the chain of 
states ai , ■ • • , a } ■ , the state a, must not appear other than at either 
end. Then it is clear that 

Lemma 2: Pifli , Oy) = P(a t , ay ; Oy) [P(aj , a, ; ay)]*. 

More generally, let s , Si , • • • , s„_i be a sequence in P(di , dj) with 
the motion diagram 

&'o Si • • ■ s n —i 

a, ay, dj., • • ■ dj n _ l dj . 

We denote by P{di , ay ; a', , a', , • • • , a' m ) a subset of P(di , dj) such 
that a sequence So , Si , • • • , s„_i is in P(a,- , ay ; d il , a,- 2 , • • • , a, m ) if 
and only if the two sets of states (a,-, , a ;,-, , • • • , a, m ) and (ay, , ay, , • • • , 
ay„_,) are disjoint. In other words , P(a t - , ay ; di l , a,- 2 , • • • , a,,,,) is the 
set of finite sequences such that, beginning with state a, , each of these 
sequences not only causes A to terminate in state ay but also causes A 
never to go through states a,, , a, 2 , • ■ • , a, m . It is permissible, how- 
ever, for a, or ay to be one of the states a,-, , a,-, , • • • , a, m . 

Lemmd 3: Let A be a finite automaton with k states do , di , • • • , 
a*_i . Then, for all pairs of states a, , ay , and for all m, 1 ^ m ^ k — 1, 

P(a,-, fly; fl', , a' 2 , • • • , fl,J = 
P(«,- , ay; a!, , • ■ • , a lm+1 ) v P(a,- , a, m+1 ; a,, , • • • , a im+1 ) 

[/ J (o.-,n + , » a»m+i J o!i . • ' ' » fl L +1 )] * P(o*»+i i a i ; °m i • • • » «••>»+.)• 

Proof: Suppose that an input sequence belongs to the set on the left- 
hand side. Then this sequence causes A to either go through state a, m+I 

t In an unpublished report shown to me by H. Wang, I found a similar result 
worked out independently by R. McNaughton and H. Yamada. 
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or it does not. If it does not, then it clearly belongs to P{ai , a, ; a ix , 
■ " * > a im+i)- If ^ does, then it belongs to the second set on the right 
hand side. Conversely, suppose a sequence belongs to the set on the right 
hand side. Then it clearly belongs to P(ac , a, ; a,-, , • • • , a,- m ), and the 
proof follows. 

Combining the two lemmas, we get 

Theorem 6: Let A be a finite automaton with k states Gto , «i , • • ■ , 
Ofc_i . Let a be the initial state of A and a ri , a T „ , • • • , a r „ be all those 
states of A whose output symbol is one. Then the signal set for A is the 
union 

n 

T(A) = V P(a Q , a Ti ; a ri ) [P(a Ti , a r< ; a ri )]*, 



which can be obtained by repeated application of Lemma 3. 
As an illustration, let us consider the automaton A below: 







Symbol 






State 








Output 



















1 




a 


a 






Oi 


1 


«1 


Ol 






a 2 


1 


a- 


o» 






Oi 






By Lemma 2, we have 

T(A) = P(a , a ; a ) [P(a , a ; a )]* 

v P(a , ai ; ai) [P(ai , ai ; Oi)]*. 



Now 



Therefore, 



P(do , a ; a ) = 0, 

P(a , ai ; ai) = 0* 1, 

P( ai , ai ; oi) = v 1 0* 1. 

T(A) = 0* v 0* 1 (0 v 1 0* 1)*. 



The expressions for signal sets can get very lengthy. The problem of 
reducing the length of these expressions without recourse to an exhaus- 
tive search appears very difficult and intriguing. 
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The next problem we will consider is how to give a state-symbol 
characterization of signal sets. The procedure we will describe here is a 
modification of the abstract ideas of Rabin and Scott" and Myhill. 13 

Let us begin this discussion of several examples. Let .4 and B be the 
following finite automata: 



A: 



B: 



State 


Symbol 


Output 


State 


Symbol 


Output 





1 





l 




«i 






1 


bi 


b B 


6i 

bo 




1 



with signal sets T(A) = 1* (1 1* 0)* and T(B) = (0 v 1) [(0 v 1) 
(0 v 1)]*. 

Example 1. Suppose we wish to construct an automaton C such that 
r(C) = T(A) v T(B). We begin by defining a set of new states (ao , b ), 
(oo , b\), (ai , 6 ), (ai , bi), some of which may turn out to be superfluous. 
The state (a , bo) is defined to be the initial state of C. Beginning with 
the state (ao , &o), we can construct a part of G: 



State 


Symbol 


Output 


1 





(oo , bo) 


{ax , 6,) 


(ao , bi) 


w(a ) Vw(bo) = 



where, if we let Mo denote the function taking state-symbol pairs to 
states for the automaton C and w be the function taking states to output 
symbols, then 

M c [(oo , 6o), 0] = (M A (a , 0), M B (b , 0)), 

Mc[(a , 6 ), 1] = (M A (ao , 1), M B (b , 1)) 

and 



a>[(ero , bo)] = w(a ) v u(b ). 

In this process, we reached two new states (ai , b\) and (ao , b t ). Con- 
tinuing the process, we eventually get for C the state-symbol table 
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Symbol 






State 








Output 





1 


*(a , b ) 


(ai , 61) 






(«to , 61) 





(a , 61) 


(oi , b ) 






(oo , 60) 


1 


(ai , 61) 


bo 






(a , b ) 


1 


(01 , b ) 


bi 






(a , bi) 


1 


bo 


fc. 






bx 





&! 


&o 






bo 


] 



Let us suppose that an input sequence s , s x , 
T(A) and gives the motion diagram 



s„_i belongs to 



So Si 

a a n a,. 



S n -1 

/\ 



Then the same sequence would give rise to a chain of states of C such 
that the terminal state of this chain must be (a,„ , bj) for some state bj 
of B. Since 



(««„ , &;) = «(o*») v "(by) = 1, 



r(C). 



it follows that this input sequence belongs to T(C) and r(A) 
In the same way, we may show that T(B) C T(C). 

Conversely, if a sequence in T(C) gives a chain of states of C: (ao , 60), 
(a,-, , b,,), • • • , (a<„ , bi n ), then either w(a, n ) = 1 or a)(fc, n ) = 1. There- 
fore, this input sequence is either in T(A) or T(B), and thus r(^) v 

r(B) = r(C). 

Example 2. We wish to construct a finite automaton C such that 
T(C) = T(A)T(B). We begin with the initial state of A as the initial 
state for C. Now, whenever a state of A is reached whose output symbol 
is a 1, we must then allow C the opportunity to imitate the behavior of 
B. In such cases, therefore, new states may be created. Thus, a part of 
the state-symbol table for C would be 



State 




Symbol 




Output 





1 


*a 


61 


do 
(a , 61) 




1 



The state (a , bi) is defined by (M A (a,i , 1), M B (bo , 1)). In this way, the 
new state allows C to imitate immediately the behavior of state bo of 
B. Also, if either u(a ) = 1 or ui(bi) = 1, then w(a , 61) = 1. We may 
therefore continue this process to get for C the state-symbol table: 
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Symbol 










Output 







1 


*ao 


ai 


ao 





ai 


bi 


(a , bi) 


1 


61 


bo 


bo 





60 


b, 


bi 




(a , 61) 


(ai , b ) 


(a , b ) 




(ai , b ) 


bi 


(a , bi) 




(a , 60) 


(ai , bi) 


(ao , 61) 





(ai , bi) 


(bi , b ) 


(a , b , bi) 




(61 , bo) 


(bo , 61) 


(bo , 61) 




(a , &o , bi) 


(ai , 60 , bi) 


(a , b , bt) 




(ai , 6 , bi) 


(bi , 60) 


(a , b , bi) 





where we see that (ai , 60 , &i) = (01 , &i). 

The process can be formulated as follows: If (a,- , • • • , ay , bk , • • • , 
b m ) is a new state, then 

M c [(ai , • • ■ , ay , b* , ■ • • , &,„), x] 

= (Mc(a,-, 3), ■ ■ ■ , M c (a,-, x), M B (bk, x), ■ • • , ilf B (6 m , a;)) 

where a; is either or 1, and co[(a,- , • • • , ay , 6* , • • • , &„,)] = co(a») v 
• • • v w(/>fc) v • • • v w(6,„). Also, if a,- is any state such that w(a») = 1, 
then il/ c (a,- , x) = (M A {fl< , x), M B (bo , x)). For all other a; and for all 
bk we have 

M c (fli , x) = M A (a } - , x), 

M c (bk , x) = M B (bk , x), 

where x is again either or 1. 

Example 3. We wish to construct an automaton C such that T(C) = 
fr(A)]*. The idea here is that whenever a state of A is reached whose 
output symbol is a 1, we must allow C the opportunity to begin again 
at state ao of A . Furthermore, since the empty sequence is a member of 
r(C), it is necessary to define for C a new initial state Co whose output 
symbol is 1 . Following this line of thought, we see that the state-symbol 
table for C is 







Symbol 






State 








Output 



















1 




*Co 


a 






ao 


1 


ao 


ai 






ao 





ai 


ai 






a 


1 
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In general, the process is formulated as follows. If (a,- , • • • , ay) is a new 
state, then 

M Sfli , • • • , ay), x] = (M c (a,i ,x),---, M c {aj , x)), 

where x is either or 1 and 

o}[(ai , • • • , dj)\ = o)(ai) v • • • v co(ay). 

If a» is any state of A whose output symbol is 1, 

M c {ai , x) = (M A (ai , x), M A (a , re)), x = or 1. 

For all other states ay of A , 

M c {(ij , x) = M A (dj ,x), x = or 1. 

The ideas of conversion from signal sets to state-symbol table for 
a finite automaton are all contained in these examples. Since to state a 
theorem means a repetition of what we outlined in the examples, we will 
content ourselves with the following form of Kleene's result. 3 

Remark. Let T(A) be a set of finite sequences built up from the opera- 
tions union, string product and closure operating on a finite set of finite 
sequences. That is, T(A) is given by a finite expression involving the 
operations union, string product and closure. Then, following the proce- 
dures outlined in Examples 1, 2 and 3, a finite automaton can be con- 
structed having T(A) as its signal set. 

This remark, together with Theorem 6, thus provides the two-way 
linkage between finite automata and signal sets. 

VII. CONCLUDING REMARKS 

We have discussed three approaches to a theory of automata and finite 
automata : the state-symbol table model, the W-machine program model 
and the signal-set model. Of these, we are most intrigued by the pro- 
gramming model. This approach not only resembles strongly computer 
programming, but it also offers possibilities of symbol operation and 
other combinatorial programs, all based on a very simple and elegant 
program structure. (One other model not studied here is a system pro- 
posed by Post.) It is quite possible a combination of these systems may 
offer deeper insight into the global structure of programming and auto- 
mata which is lacking at present. 
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